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Abstract. A (left) quotient of a language L by a word w is the language 
w~ 1 L = {x | wx £ L}. The quotient complexity of a regular language L 
is the number of quotients of L; it is equal to the state complexity of L, 
which is the number of states in a minimal deterministic finite automaton 
accepting L. An atom of L is an equivalence class of the relation in 
which two words are equivalent if for each quotient, they either are both 
in the quotient or both not in it; hence it is a non-empty intersection of 
complemented and uncomplemented quotients of L. A right (respectively, 
left and two-sided) ideal is a language L over an alphabet E that satisfies 
L = LE* (respectively, L = E*L and L = E*LE*). We compute the 
maximal number of atoms and the maximal quotient complexities of 
atoms of right, left and two-sided regular ideals. 
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1 Introduction 

We assume that the reader is familiar with basic concepts of regular languages 
and finite automata; more background is given in the next section. Consider a 
regular language L over a finite non-empty alphabet E. Let T> = ( Q , E, 6 , < 71 , F) 
be a minimal deterministic finite automaton (DFA) recognizing L, where Q is 
the set of states , S: Q x E —> Q is the transition function, q\ is the initial state, 
and F C Q is the set of final states. There are three natural equivalence relations 
associated with L and V. 

The Nerode right congruence [13] is defined as follows: Two words x and y 
are equivalent if for every v £ E*, xv is in L if and only if yv is in L. The set 
of all words that “can follow” a given word x in L is the left quotient of L by x, 
defined by x~ x L = {v | vx £ L}. In automaton-theoretic terms x~ x L is the set 
of all words v that are accepted from the state q = S(q\,x ) reached when x is 
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applied to the initial state of T >; this is known as the right language of state q, 
the language accepted by DFA V q = (Q,E,S,q,F). The Nerode equivalence 
class containing x is known as the left language of state q, the language accepted 
by DFA q V = (Q,E,8,q\,{q}). The number n of Nerode equivalence classes is 
the number of distinct left quotients of L, known as its quotient complexity [1]. 
This is the same number as the number of states in T >, and is therefore known 
as L’s state complexity [15]. Quotient/state complexity is now a commonly used 
measure of complexity of a regular language, and constitutes a basic reference 
for other measures of complexity. One can also define the quotient complexity 
of a Nerode equivalence class, that is, of the language accepted by DFA q V. In 
the worst case - for example, if D is strongly connected - this is n for every q. 

The Myhill congruence [12] refines the Nerode right congruence and is a 
(two-sided) congruence. Here word x is equivalent to word y if for all u and v 
in E *, uxv is in L if and only if uyv is in L. This is also known as the syntactic 
congruence [14] of L. The quotient set of E + by this congruence is the syntactic 
semigroup of L. In automaton-theoretic terms two words are equivalent if they 
induce the same transformation of the set of states of a minimal DFA of L. The 
quotient complexity of Myhill classes has not been studied. 

The third equivalence, which we call the atom congruence is a left congruence 
refined by the Myhill congruence. Here two words x and y are equivalent if ux £ L 
if and only if uy £ L for all u £ E*. Thus x and y are equivalent if x £ u~ 1 L if 
and only if y £ u~ l L. An equivalence class of this relation is called an atom of 
L [9]. It follows that an atom is a non-empty intersection of complemented and 
uncomplemented quotients of L. 

This congruence is related to the Myhill and Nerode congruences in a natural 
way. Say a congruence on E* recognizes L if L can be written as a union of the 
congruence’s classes. The Myhill congruence is the unique coarsest congruence 
(that is, the one with the fewest equivalence classes) that recognizes L [14]. 
The Nerode and atom congruences are respectively the coarsest right and left 
congruences that recognize L. 

The quotient complexity of atoms of regular languages has been studied in [4, 
8 ,11]. In this paper we study the quotient complexity of atoms in three subclasses 
of regular languages, namely, right, left, and two-sided ideals. 

Ideals are fundamental concepts in semigroup theory. A language L over 
an alphabet A 1 is a right (respectively, left and two-sided) ideal if L = LE* 
(respectively, L = E*L and L = E*LE*). The quotient complexity of regular 
ideal languages has been studied in [5], and the reader should refer to that paper 
for more information about ideals. Ideals appear in pattern matching. A right 
(left) ideal LE* ( E*L) represents the set of all words beginning (ending) with 
some word of a given set L, and E*LE* is the set of all words containing a factor 
from L. 
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2 Preliminaries 


It is well known that a language L C S* is regular if and only if it has a finite 
number of quotients. We denote the number of quotients of L (the quotient 
complexity) by k(L). This is the same as the state complexity, the number of 
states in a minimal DFA of L. Since we will not be discussing other measures of 
complexity, we refer to both quotient and state complexity as just complexity. 

Let the set of quotients of a regular language L be K = {I\ L ,..., K n }. The 
quotient automaton of L is the DFA T> = ( K, £, S, L, F), where 8(Ki, a) = Kj if 
a~ 1 Kj = Kj, L = K\ = e~ 1 L by convention, and F = (ATj | e £ Kij. This DFA 
is uniquely defined by L and is isomorphic to every minimal DFA of L. 

A transformation of a set Q n of n elements is a mapping of Q n into itself, 
whereas a permutation of Q n is a mapping of Q n onto itself. In this paper 
we consider only transformations of finite sets, and we assume without loss of 
generality that Q n = {1,..., n}. An arbitrary transformation has the form 

1 2 • • • n — 1 n 
il Z2 * ' * ^n—1 

where ik £ Qn for 1 ^ k ^ n. The image of element i under transforma¬ 
tion t is denoted by it. The image of S C Q n is St = The identity 

transformation 1 maps each element to itself. For k ^ 2, a transformation (per¬ 
mutation) t is a k-cycle if there is a set P = {q\,q 2 ,... ,qk} C Q n such that 
if qit. = q 2 ,q 2 t = q 3 ,...,q k -it = qk,qkt = <?i, and qt = q for all q £ P. A 
/c-cycle is denoted by (qi,q 2 , • • ■, qu)- A 2-cycle (qi,q 2 ) is called a transposition. 
A transformation is constant if it maps all states to a single state q; we denote 
it by (Q n —» q). A transformation t is unitary if p ^ q, pt = q and rt = r for all 
r ^ p; we denote it by (p —>• q). The following is well-known: 

Proposition 1. The complete transformation monoid T n of size n n can be gen¬ 
erated by any generators of the symmetric group S n (the group of all permu¬ 
tations of Q n ) together with a unitary transformation. In particular, T n can be 
generated by {(1,..., n), (1, 2), (n —> 1)}, and by {(1 ,..., n), (2,..., n), (n —> 1 )}. 

For a DFA V = (Q,S,8,q\,F) we define the transformations | w £ 
£ + } by qS a = 8(q,a) for a £ S*, and qS w = qS x S a for w = xa. This set is 
a semigroup under composition and it is called the transition semigroup of T>. 
The transformation S w is called the transformation induced by w. To simplify 
notation, we usually make no distinction between the word w £ S + and the 
transformation S w . If T> is the quotient automaton of L, then the transition 
semigroup of V is isomorphic to the syntactic semigroup of L [14]. A state q £ Q 
is reachable from p £ Q if pw = q for some w £ S + , and reachable if it is 
reachable from qi. Two states p, q are indistinguishable if pw £ F <tt> qw £ F for 
all w £ F + , and distinguishable otherwise. Indistinguishability is an equivalence 
relation on Q; furthermore, if T> recognizes a language L, we can compute n(L) 
by counting the number of equivalence classes under indistinguishability of the 
reachable states of V. A state is empty if its right language (defined in the 
introduction) is 0. 
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3 Atoms 


Atoms of regular languages were studied in [9], and their complexities in [3,8]. As 
discussed earlier, atoms are the classes of the atom congruence, a left congruence 
which is the natural counterpart of the Myhill two-sided congruence and Nerode 
right congruence. The Myhill and Nerode congruences are fundamental in regular 
language theory, but it seems comparatively little attention has been paid to the 
atom congruence and its classes. In [2] it was argued that it is useful to consider 
the complexity of a language’s atoms when searching for highly complex regular 
languages, since one would expect such languages to have highly complex atoms. 

Below we present an alternative characterization of atoms, which we use in 
our proofs. Earlier papers on atoms such as [3,8,9] take this as the definition 
of atoms, for it was not known until recently that atoms may be viewed as 
congruence classes (this fact was first noticed by Ivan in [11]). 

From now on assume all languages are non-empty. Denote the complement 
of a language L by L = £* \ L. Let Q n = {1,... ,n} and let L be a regular 
language with quotients K = {K \,..., K n }. Each subset S of Q n defines an 
atomic intersection As = flies D f]where S = Q n \ S. An atom of 
L is a non-empty atomic intersection. Since atoms are pairwise disjoint, every 
atom A has a unique atomic intersection associated with it, and this atomic 
intersection has a unique subset S of K associated with it. This set S is called 
the basis of A. 

Throughout the paper, L is a regular language of complexity n with quotients 
K \,..., K n and minimal DFA V = (Q n , £, 5,1, F) such that the language of 
state i is K t . Let As = f];es Ki n Dies K, be an atom. For any w £ £* we have 

■u; _1 As = P| ur 1 K i n P w~ 1 K i . 

ie s 

Since a quotient of a quotient of L is also a quotient of L , w~ x As has the form; 

w~ l A s = p| K t n p Kl, 

iex ieY 

where |AT| ^ |Sj and |F| < n — |«Sj, X,Y C Q n . 

The complexity of atoms of a regular language was computed in [8] using a 
unique NFA defined by L n , called the atomaton. In that NFA the language of 
each state qs is an atom As of L n . To find the complexity of that atom, the 
atomaton started in state qs was converted to an equivalent DFA. A more direct 
and simpler method was used by Szabolcs Ivan [11] who constructed the DFA for 
the atom directly from the DFA V n . We follow that approach here and outline 
it briefly for completeness. 

For any regular language L an atom As corresponds to the ordered pair 
(S, S), where S ( S ) is the set of subscripts of uncomplemented (complemented) 
quotients. If L is represented by a DFA V = ( Q, £, 5, qi,F), it is more convenient 
to think of S and S as subsets of Q. Similarly, any quotient of As corresponds 
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to a pair (X, Y) of subsets of Q. For the quotient of Ag reached when a letter 
a £ E is applied to the quotient corresponding to (X,Y) we get 


»■' n K < n n 

\i£X ieY 

In terms of pairs of subsets of Q , from (X,Y) we reach ( Xa,Ya ). Note that if 
XCY ^ 0 in (X, Y) then the corresponding quotient is empty. Note also that the 
quotient of atom Ag corresponding to (X, Y) is final if and only if each quotient 
Ki with i £ X contains s, and each Kj with j £ Y does not contain e. 

These considerations lead to the following definition of a DFA for As- 

Definition 1. Suppose V = ( Q , E, S, qi,F) is a DFA and let S C Q. Define the 
DFA V s = (Qs,E,A,(S,S),F s ), where 

- Qg = {(X,F) |X,X CQ,XnF = 0}U{T}. 

- For all a £ E, A((X, Y), a) = {6(X, a),S(Y , a)) if S(X, a) fl 5(Y, a) 0, and 
A((X,Y),a) = _L otherwise; and A(_L,a) = _L. 

- F s = {(X, Y) \ X CF,Y C F}. 

DFA Vg recognizes the atomic intersection As of L. If Vg recognizes a non¬ 
empty language, then As is an atom. 


K, = 


= n 


a 1 Ka 


iex 


n O 

ieY 


-'Ki - fl K„ 
iex 


n f k u 

ieY 


4 Complexity of Atoms in Regular Languages 


Upper bounds on the maximal complexity of atoms of regular languages were 
derived in [8]; for completeness we include these results. For n = 1 there is only 
one non-empty language L = E *; it has one atom, L, which is of complexity 1. 
From now on assume that n ^ 2. 


Proposition 2. Let L be a regular language with n ^ 2 quotients. Then L has 
at most 2" atoms. If S £ {Q„,0}, then k(As ) ^ 2" — 1 quotients. Otherwise, 



Proof. Since the number of subsets S of Q n is 2", there are at most that many 
atoms. For atom complexity consider the following three cases: 

1. S = Q n . Then A Qn = fi] ieQn Ki is the intersection of all quotients of L. For 
w £ E*, w~ 1 A Qn = n ie * K ,, where 1 ^ |X| ^ |Q„|. Hence there are at 
most 2" — 1 quotients of this atom. 

2. 5 = 0. Now A 0 = fliGQn anci url M = flier where 1 < |Y| < \Q n \- 
As in the hrst case, there are at most 2 n — 1 quotients of this atom. 

3. 0 C S C Q n . Then Ag = f| i6S K t fl f| ig g Kj. Every quotient of Ag has the 

form w~ l A s = f |iex n f |ieY K i, where 1 < \X\ < |5| and 1 < |Y| < 
n — |S|. There are two subcases: 
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(a) If ITl 7 ^ 0, then w _1 As = 0. 

(b) If X n Y = 0, there are at most (") ("~ x ) quotients of As 

of this form. This follows since (") is the number of ways to choose a set 
X C of size x, and once X is fixed, ( n ~ x ) is the number of ways to 
choose a set Y C Q n of size y that is disjoint from X. Taking the sum 
over the permissible values of x and y gives the formula above. 

Adding the results of (a) and (b) we have the required bound. □ 

It was shown in [2] that the language L n accepted by the minimal DFA V n 
of Definition 2, also illustrated in Figure 1, meets all the complexity bounds for 
common operations on regular languages. 

Definition 2. For n ^ 2, let V n = (Q n , X,5 n , 1, {n}), where Q n = {1,... , n} 
is the set of states, X = {a, b , c} is the alphabet, the transition function S n is 
defined by a = (1,..., n), b = (1, 2), and c = (n —> 1), state 1 is the initial state, 
and {n} is the set of final states. Let L n be the language accepted by V n . (If 
n = 2, a and b induce the same transformation; hence X = {a, c} suffices.) 



Fig. 1. DFA of a regular language whose atoms meet the bounds. 


It was proved in [8] that L n has 2 n atoms, all of which are as complex as pos¬ 
sible. We include the proof of this theorem following [11]. We first prove a general 
result about distinguishability of states in T>s, which we will use throughout the 
paper. 

Lemma 1 (Distinguishability). Let V = (Q,E,5,q\,F) be a minimal DFA 
and for S C Q, let T>s = (Qs, X, A, (S, S), Fs) be the DFA of the atom As- 
Then: 

1. States (X,Y) and (X',Y') ofT>s are distinguishable if X X' and Ax, Ax’ 

are both atoms, or ifY f^Y’ and Ay, Ayr are both atoms. 

2. If one of Ax or Ay is an atom, then (X,Y) is distinguishable from _L. 

Proof. First note that if Az is an atom, then the initial state of Dz must be 
non-empty, so there is a word wz such that (Z, Z)wz = {U, V) with U C F, 
V C F, i.e., (17, V) G Fs- In particular, (X,Y)wx € Fs, since Y C X. We also 
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have (X, Y)wy € Fs, since Y is sent to a subset of F , and X C Y is sent to a 
subset of F. This proves (2): if one of Ax or Ay is an atom, then one of wx or 
Wy is in the transition semigroup of V, and hence (X, Y) can be mapped to a 
final state but _L cannot. Now, we consider the two cases from (1): 

1 . X ^ X'. Suppose X' % X. Then ( X,Y)w x € F s , but (X',Y')w x F s , 
since X'\X is a non-empty subset of X and hence gets mapped outside of 
F. Thus Wx distinguishes these states. If instead we have X X', then wx’ 
distinguishes the states. Hence if Ax, Ax' are atoms, Wx and wx' are in the 
transition semigroup of T >, and the states are distinguishable. 

2. 7 / Y'. If Y' <2 Y , then wy distinguishes (X, Y) from (X',Y'); otherwise, 

Wyr distinguishes the states. As before, if Ay. A— are atoms then the states 
are distinguishable. □ 

Theorem 1. For n ^ 2, the language L n of Definition 2 has 2 n atoms and each 
atom meets the bounds of Proposition 2. 

Proof. The DFA for the atomic intersection As is T>s = (Qs, X, A , (S, S), Fs), 
where Fs = {(X, Y) \ X C {n},Y C Q n \ {n}}. The transition semigroup of 
V n consists of all n n transformations of the state set Q n . Hence ( S,S) can be 
mapped to a final state in Fs by taking a transformation that sends S to {n} 
and S to {1}. It follows that all 2" atomic intersections As, S C Q n are atoms. 
By the Distinguishability Lemma, all distinct states in Vs are distinguishable. 
It suffices to prove the number of reachable states in each Vs meets the bounds. 

If S = Q n , then As is represented by (Q n , 0), the reachable states of Vs are 
of the form (X, 0), where X is the image of Q n under some transformation in the 
transition semigroup. Since we have all transformations, we can reach all 2" — 1 
states (X, 0), 0 C X C Q n . For 5 = 0 a similar argument works. 

If 0 C S' C Q nj then for any state (X, Y) with 1 ^ X < |S|, 1 < Y < n — |S| 
and XnX = 0, we can find a transformation mapping S onto X and S onto Y. 
So all these states are reachable, and there are ' (") ( U y X ) °f them. 

In addition, J_ is reachable from (S, S) by the constant transformation ( Q n —> 1) 
and so the bound is met. □ 


5 Complexity of Atoms in Right Ideals 


If L is a right ideal, one of its quotients is E*; by convention we assume that 
K n = E*. In any atom As the quotient K n must be uncomplemented, that is, 
we must have n G S. Thus Ag is not an atom. The results of this section were 
stated in [4] without proof; for completeness we include the proofs. 

Proposition 3. Suppose L is a right ideal with n ^ 1 quotients. Then L has at 
most 2 n ~ 1 atoms. The complexity k(As) of atom As satisfies 


k(A s ) < 


J 2 n ~ 1 , 

\ 1 + El=i E^=i s| 


(n— 1 \ fn—x\ 
\x-l)\ V /’ 


if S = Q n ; 

!/0CSCQ„. 


( 1 ) 
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Proof. Let As be an atom. Since w~ x E* = E* for all w £ E*, always 

has K n uncomplemented; so if ( X, Y) corresponds to u~ 1 As, then n £ X. Since 
the number of subsets S of Q n containing n is 2" _1 , there are at most that many 
atoms. Consider two cases: 

1.5 = Q n . Then u~ 1 L = P| i x Ki, and each such quotient of A$ is represented 
by (X, 0), where 1 ^ |X| ^ n. Since n is always in X, there are at most 2 n ~ 1 
quotients of this atom. 

2. 0 C S' C Q n . Then w~ x A s = f| ieX Ki n f| ie y K ,, where 1 < |X| < \S\ and 
1 ^ |Y| ^ n— |S|. We know that if XC\Y ^ 0, then w _1 Ag = 0. Thus we are 
looking for pairs (X, Y) such that n £ X and X ft Y = 0. To get X we take 
n and choose |X| — 1 elements from Q n \ {n}, and then to get Y take |Y| 
elements from Q n \X. The number of such pairs is 0~i) ( H y X ) • 

Adding the empty quotient we have our bound. □ 

For n = 1, L = E* is a right ideal with one atom of complexity 1. For 

n = 2, L = aa* is a right ideal with two atoms L and L of complexity 2. It was 

shown in [4] that the language of the DFA of Definition 3 is most complex in 
the sense that it meets all the bounds for common operations, but no proof of 
atom complexity was given. We include this proof here. 

Definition 3. For n ^ 3, let T> n = (Q n , E, S n , 1, {n}), where E = {a,b,c,d}, 
and S n is defined by a = (1,... ,n — 1), b = (2,... ,n — 1) , c = (n — 1 —>• 1) and 

d = (n — 1 —»• n). Let L n be the language accepted by V n . If n = 3, b is not 

needed; hence E = {a,c,d} suffices. Also, let L 2 = aa* and L\ = a*. 



Fig. 2. DFA of a right ideal whose atoms meet the bounds. 


Theorem 2. For n ^ 1, the language L n of Definition 3 is a right ideal that 
has 2 n ~ 1 atoms and each atom meets the bounds of Proposition 3. 

Proof. The cases n < 3 are easily verified; hence assume n ^ 3. By Proposition 
1, the transformations {a,b,c} restricted to Q n -i generate all transformations 
of Q n - 1 - When d is included, we get all transformations of Q n that fix n. For 
S C Q n , n £ S , consider the DFA T>s, which has initial state (S,S). There is 



a transformation of Q n fixing n that sends (5,5) to the final state ({n},{l}). 
Hence As is an atom if n £ 5, and so L n has 2 n ~ 1 atoms. 

We now count reachable and distinguishable states in the DFA of each atom. 
Suppose 5 = Q n . The initial state of V$ is (Q n ,0); by transformations that 
fix n, we can reach any state ( X , 0) with {n} C X C Q n . There are 2 n ~ 1 such 
states, and since Ax is an atom if n £ X, all of them are distinguishable by the 
Distinguishability Lemma. 

Suppose 0 C 5 C Q n . From the initial state (5,5), by transformations that 
fix n we can reach any (X, Y) with 1 ^ |X| < 151, 1 ^ |V| < n — |5|, n £ X and 
X tlY = 0. There are Ey=i^ ("I*) x ) such states. For all such states 

(X, Y), we have n £ X and n £ Y, so Ax and Ay are both atoms; hence by 
the Distinguishability Lemma, all of these states are distinguishable from each 
other and from _L. The state _L is also reachable by the constant transformation 
{Qn —> fi), and so the bound is met. □ 

6 Complexity of Atoms in Left Ideals 

If L is a left ideal, then L = E*L, and w~ 1 L contains L for every w £ X*. By 
convention we let L = I\\. 

Proposition 4. Suppose L is a left ideal with n ^ 2 quotients. Then L has at 
most 2 n ~ 1 + 1 atoms. The complexity k(As) of atom As satisfies 

— n., if S — Qnj 

s£2"-\ if S = 0; (2) 

< 1 + El=i E^i S| (V) (T-i 1 ) ’ otherwise. 

Proof. Consider the atomic intersections such that 1 £ 5; then flics = ^ 
(since every quotient contains L), and there are two possibilities: Either S = Q n , 
in which case Ag = AQ n = fljeQ„ = A or there is at least one quotient, say 
Ki which is complemented. Since Kj contains L, it can be expressed as Kj = 
L U Mi, where L flMj = 0. Then the intersection has the term Ln(lU Mi) = 0, 
and As is not an atom. Thus for As to be an atom, either 1 ^ S or S = Q n . 
Hence there are at most 2 n_1 + 1 atoms. 

For atom complexity, consider the following cases: 

1. S = Q n . Then = L, and the complexity of Hg n is precisely n. 

2 .5 = 0. Now = HieQn an< ^ evei T quotient of A$ is an intersection 
fl t Y Ki, where 1 < |T| < |<5„|. There are 2 n — 1 such intersections, but 
consider any quotient I\i ^ L of a left ideal; it can be expressed as A',; = 
L U Mi, where L n Mj = 0. We have 

Ad n Af = l n LUM = LnLnW i = LnW i =T£ i . 

Thus every intersection fl iey A', which has 7^0 and does not have Ad as 
a term defines the same language as Ad fl fl igy A',;. There are 2 n ~ 1 — 1 such 
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intersections. Adding 1 for the intersection which just has the single term 
Ki, we get our bound 2™ _1 . 

3- 0 C S C Q n . Then As = D ieS n Dies w h ere neither S nor S is 
empty. If 1 € A this intersection is empty, and so is not an atom. Assume 
from now on that 1 ^ S. Every quotient of As has the form w~ 1 As = 
f] ieX K i n flier where 1 < |X| < |5| and 1 < |Y| < n - |5|. 

(a) 1 G X. We claim that u> -1 As = 0 for all w G E* . For suppose that 
there is a term Ki, i G S, and a word w G E* such that w~ 1 Ki = K\. 
Since K i C K ,, we have w~ l K\ C w _1 Ki = K\, Since also K\ C 
w~ 1 Ki because L is a left ideal, we have w~ r K\ = K\. But 1 G S', so 
ur 1 (Plies Ki) = P| iey K t has uW K[ = I<i as a term. Thus 1 G F, 
which means X fl Y ^ 0. Hence w~ 1 As = 0. 

(b) 1 ^ X. We are looking for pairs (X,Y) such that X fl Y = 0. As we 

argued in (2), K\ fl Ki = Ki for each i, so we can assume without loss of 
generality that 1 G Y. To get X we choose |X| elements from Q n \ {1} 
and to get Y we take {1} and choose |F| — 1 elements from (Q n \X)\{l}. 
The number of such pairs is X)L=i ("p 1 ) C'y-i 1 )- 

Adding 1 for the empty quotient we have our bound. □ 

Next we compare the bounds for left ideals with those for right ideals. To 
calculate the number of pairs (X, Y) such that n G X and X fl Y = 0 for right 
ideals, we can first choose Y from Q. n \ {n} and then take n and choose |A| — 1 
elements from ( Q n \ Y)\ {n}. The number of such pairs is 


n-|S| \S\ 
y —1 x=l 


n — 1 

y 


n-y-l\ 
x —1 ) 


If we interchange x and y we note that this is precisely the number of pairs 
(X, Y) such that 1 G Y and An Y = 0 for an atom of a left ideal with a basis 
of size n — |Sj. Thus we have 

Remark 1. Let I? be a right ideal of complexity n and let Ag be an atom of R, 
where 0 C S C Q n . Let L be a left ideal of complexity n and let AY be an atom 
of L. The upper bounds on the complexities of As and AY are equal. 

Now we consider the question of tightness of the bounds in Proposition 4. 
For n = 1, L = S* is a left ideal with one atom of complexity 1; so the bound 
of Proposition 4 does not hold. 

The DFA of Definition 4 and Figure 3 was introduced in [10]. It was shown 
in [7] that the language of this DFA has the largest syntactic semigroup among 
left ideals of complexity n. Moreover, it was shown in [6] that this language also 
meets the bounds on the quotient complexity of boolean operations, concatena¬ 
tion and star. Together with our result about the number of atoms and their 
complexity, this shows that this language is the most complex left ideal. 

Definition 4. For n ^ 3, let T> n = ( Q n , A, S n , 1, {n}), where E = {a, b, c, d, e}, 
and 5 n is defined by a = (2 b = (2,3), c = (n —> 2), d = (n —>• 1), 
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and e = ( Q n —» 2). I/n = 3 , inputs a and b coincide; hence £ = {a,c,d,ej 
suffices. Also, let V 2 = (Q2, {a, b, c}, 62, 1 , {2}), where a = 1, b = ( Q2 —>• 2), 
c = (Q2 —>■ !)• Let 6e t/ie language accepted by V n ; we have L2 = £*b(aUb)*. 



Fig. 3. DFA of a left ideal whose atoms meet the bounds. 


Theorem 3. For n ^ 2, the language L n of Definition f is a left ideal that has 
2 n—1 + 1 atoms and each atom meets the bounds of Proposition f. 

Proof. It was proved in [10] that L n is a left ideal of complexity n. The case 
n = 2 is easily verified; hence assume n ^ 3. It was proved in [7] that the 
transition semigroup of D n contains all transformations of Q n that fix 1 and 
all constant transformations. Recall that if As is an atom of a left ideal, then 
either S = Q n or 1 ^ S. For all S with 1 ^ S, from (S, S) we can reach the final 
state ({n}, {1}) of T>s (or (0, {1}) for S = 0) by transformations that fix 1. For 
S = Q n , let w = (Q n —>• n); then (Q n ,®)w = ({n},0) is final in Vs- Hence if 
S = Qn or 1 S, then is an atom of L n , and so L has 2" -1 + 1 atoms. 

We now count reachable and distinguishable states in the DFA of each atom. 
We know that AQ n has complexity n for all left ideals, so assume 1 ^ S. If 
S = 0, the initial state of Vs is (0,Q„). By transformations that fix 1 we can 
reach (0, Y ) for all Y with {1} CYC Q n . There are 2™” 1 of these states. Since 
Y does not contain 1, Ay is an atom, so all of these states are distinguishable 
by the Distinguishability Lemma. 

If 0 C S' C Q n , the initial state of Vs is (S, S). Since 1 ^ S, by transforma¬ 
tions that fix 1, we can reach any state (X,Y) with 1 ^ |A| ^ |S|, 1 ^ |T| ^ 

n - |S|, 1 £ X, 1 e y, and A n Y = 0. There are £jfii E,"=| S| (V) (" ~ v --i') 
such states. They are all distinguishable from each other and from _L by the 
Distinguishability Lemma, since 1 fL X, 1 £ Y imply that Ax and A— are both 
atoms. We can also reach _L from (S, S ) by any constant transformation, and so 
the bound is met. □ 
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7 Complexity of Atoms in Two-Sided Ideals 

7.1 Upper Bounds 

A language is a two-sided ideal if it is both a right ideal and a left ideal. 

Proposition 5. Suppose L is a two-sided ideal with n ^ 2 quotients. Then L 
has at most 2 n ~ 2 + 1 atoms. The complexity k(As) of atom Ag satisfies 

{ = n, ifS = Q n ; 

<2"- 2 +n—1, if S = Q n \ {1}; (3) 

< 1 + Ell G=?) (Ti 1 ), Otherwise. 

Proof. Since L is a left ideal, As is an atom only if S = Q n or S' C Q n \ {1}; 
since L is a right ideal we must also have n £ S. This gives our upper bound of 
2"^ 2 + 1 atoms. 

We know that Aq u has complexity n since this is true for left ideals. Since 
L is a right ideal, Ag is not an atom, so we can assume S ^ 0. 

Suppose As is an atom of L, with S Q n and S ^ Q n \ {1}. We proved for- 
left ideals that the number of distinct non-empty quotients of As is bounded by 
the number of pairs (X, Y), 1 < |X| < |S|, 1 < \Y\ < n — |S|, 1 £ X, 1 £ Y, 
X C\ Y = 0. Since L is a right ideal, we must also have n £ X and n &Y. There 
are (|x[--i) possibilities for X, since X must contain n and the remaining |X| — 1 

elements are taken from Q„\{1, n}. If X is fixed, there are (pyyu*) possibilities 
for Y, since Y must contain 1 and the remaining |U| — 1 elements are taken from 
(Q n \X)\ {?r}. Since Q n \ X always contains n, the size of ( Q n \X)\ {n} is 
always n — |A| — 1. Summing over the possible sizes of X and Y and adding 1 
for the empty quotient, we get the required bound. 

This leaves the case of S = Q n \ {1}. Each quotient of As has the form 

w~ 1 A s = ( p| kA HR J, 

\iex ) 

where Kj = w~ l K\ = w~ l L, and n £ X. We can view the non-empty quotients 
as states (X, {j}) of the DFA Vg for As, where V is a minimal DFA for L. We 
must have n £ X and X (~l {j} = 0, and so j £ X. Hence {n} C X C Q n \ {j }, 
and there are 2" -2 choices for X. However, for each X there are potentially n— 1 
choices for j, giving an upper bound of (n — 1)2" -2 for the non-empty quotients, 
which is not tight. We need to look more carefully at the distinguishability 
relations between states of Vg. 

For each p in Q n , define the set S(p) = {q £ Q n \ K p C K q }. The elements 
of S(p) are called the successors of p. Note that p is not a successor of itself. 

Since L is a left ideal, we have L C K t for all i £ Q n . It follows that 
w~ 1 L = Kj C w~ 1 Ki for all i £ Q n . Thus in the formula for w~ 1 Ag above, 
we have Kj C K, for all i £ X. But if Kj = K, for any i £ X, then w -1 As is 
empty. Thus Kj C K, for all i £ X, which implies X C S(j). 
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X must contain n, since L is a right ideal. Thus for each j, there are at most 
2l s 0’)l -1 distinguishable states (X, {j}). The index j can range from 1 to n — 1; 
if j = n then Xn{n} is non-empty. This gives an upper bound of 2 l ,s (AI-i 

for the number of non-empty quotients. 

This bound still is not tight, so we refine it as follows. Choose i n G S(j) 
and a non-empty set Y C S(i) \ {?r}. Then X, C K q for all q G Y, so we 

have Ki n = Ab. This means ({i,n}, {j}) is indistinguishable from 

(Y U{i, n}, {.?’}). Since Y is non-empty and does not contain n, there are at most 
2 l s WI- 1 — l possibilities for Y. 

From this we get a new upper bound for the number of distinguishable states 
(X, {j}) for a fixed j, as follows: first take our previous bound of 
Then for each i ^ n G S(j), subtract — 1 to account for the states 

(Y U { z , }, {j}) that are equivalent to ({i,n}, {j}). Our new bound is 


2lS'0')|-i _ E (2l’ sr 0)l— 1 - 1) = (\S(j)\ - 1) + - E 2 |s(i)l W 


ies(j) 

i^n 


ies(j) 

i^n 


Summing over all possible values of j, and adding 1 for the empty quotient, we 
get the following bound on the complexity of As: 


n —1 

3 = 1 


( 


\ 


(|5(j)| - 1) + 2 |SW| - 1 - J2 2 |s(i)l " 1 


V 


ies(j) 

i^n 


J 


Noting that 5(1) = {2,... , n} and |5(1)| = n — 1, we pull out the j = 1 case 
from the outermost summation: 


/ 


1 + (n — 2) + 2 n ~ 2 — E 2 |s(i)l_1 + ^2 
ies(t) 3=2 

i^n 


(|5(i)| — 1) + 2 


ISO) l-i 


V 


- E 2 

ies(j) 

i^n 


\ 


|S(i)|-l 


J 


Observe that 1 + (n — 2) + 2" 2 is equal to 2" 2 + n — 1, the bound we are 
trying to prove. We will show that the value of the rest of this formula is always 
less than or equal to zero. We pull J2j=2 2l‘ s ’0)l— 1 ou t to the front: 


/ 


2"- 2 + n -l + ^ 2 l s «l- 1 - 22 2 |s(i)| -! + ^ 


3=2 


ies(i) 

i^n 


3 =2 






V 


ies(j) 

i^n 


J 


Note that \ 2l‘ s ’0)l 1 = ^ ie s(i) 2l s WI 1 , so cancellation occurs: 


n —1 

+ n - 1 + E 
3 =2 


i^n 

l 


\ 


(|A(i)| — i) — E 2lS(i)l_1 


V 


ies(j) 

i^n 


J 
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Now, the value of the innermost summation is always greater than or equal to 
\S(j)\ — 1: for each i G S(J), i ^ n, we know that n is a successor of i, and hence 
S(i) 1 and ^ 1. Thus the value of the outermost summation is always 

less than or equal to zero. It follows that the number of quotients of As is at 
most 2 n ~ 2 + n — 1. □ 

Next we address the question of tightness of the bounds for two-sided ideals. 
For n = 1, L = E* is a two-sided ideal with one atom of complexity 1; so the 
bound of Proposition 5 does not hold. 

The DFA of Definition 5 and Figure 4 was introduced in [10]. It was shown 
in [7] that the language of the DFA of Definition 5 has the largest syntactic 
semigroup among left ideals of complexity n. Moreover, it was shown in [6] 
that this language also meets the bounds on the quotient complexity of boolean 
operations, concatenation and star. Together with our result about the number 
of atoms and their complexity, this shows that this language is the most complex 
two-sided ideal. 

Definition 5. Let n 4, and let V n = ( Q n , E, 6 n , 1, {n}) be the DFA with E = 
{a, b,c,d,e,f}, a = ( 2 ,3,..., n— 1 ), b = ( 2 ,3), c = (n — 1 —> 2 ), d = (n— 1 —> 1 ), 
e = (Q n -1 —>• 2), and f = (2 —>■ n). For n = 4, inputs a and b coincide. Also, 
let £>3 = (Q 3 , {a, b , c}, S 3 , 1, {3}), where a = 1, b = (Q 2 —» 2), c — (2 —> 3), and 
let V 2 = ( Q 2 , {a, b, cj, S 2 , 1, {2}), where a = 1 , b = ( Q 2 —»• 2), c = (Q 2 —> 1). Let 
L n be the language accepted by T> n . 


a, b, c, d, e, / a, c, e 



Fig. 4. DFA of a two-sided ideal whose atoms meet the bounds. 


Theorem 4. For n ^ 2, the language L n of Definition 5 is a two-sided ideal 
that has 2 n-2 + 1 atoms and each atom meets the bounds of Proposition 5. 

Proof. It was proved in [10] that L n is a two-sided ideal of complexity n. The 
cases with n < 4 are easily verified; hence assume n ^ 4. 
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The following observations were made in [7]: Transformations {a,b,c} re¬ 
stricted to Q n \{l, ?r} generate all the transformations of {2,..., n— 1}. Together 
with d and /, they generate all transformations of Q n that fix 1 and n. Also, we 
have ef = ( Q n —» n). 

Recall that if Ag is an atom of a two-sided ideal, then n £ S, and either 
S = Q n or 1 £ S. We know Ag n is an atom of complexity n for all left ideals 
(and hence all two-sided ideals), so assume n £ S, 1 ^ S. Then 1 £ S, and so 
from state (S, S) in Vg we can reach the final state ({n}, {1}) by transformations 
that fix 1 and n. Hence Ag is an atom for every S with n £ S, 1 0 S. There are 
2 n—2 of these atoms, as well as the atom Ag n , for a total of 2 n ~ 2 + 1. 

Consider the atom As for S ^ Q n and S ^ Q n \ {1}. In the DFA Vg , the 
initial state is (S', S), and we have n £ S, 1 ^ S. By transformations that fix 1 
and n, we can reach (X, Y) for all X,Y C Q n such that n £ X, 1 £ Y, X fl Y = 0, 
1 < \X\ < \S\, 1 < \Y\ < n- \S\. There are EL=i Ey=| S| ("lD("y-i 1 ) such 
states. Since n £ X, 1 £ X and n £ Y, 1 ^ Y we see that Ax and Ay are atoms. 
Hence by the Distinguishability Lemma, all of these states are distinguishable 
from each other and from _L. Since S / 0, we can reach _L from (S, S) by 
e f = (Qn —> n). Hence the bound is met. 

It remains to show that the complexity of Ag, S = Q n \ {1} also meets the 
bound. The initial state of Vg is ({2,... ,n}, {1}). By transformations that fix 
1 and n, we can reach all 2™~ 2 states of the form (X, {1}) with {?r} C X C 
Q n \ {1}. From ({n},{l}), we can reach n — 2 additional states ({«},{*}) for 
2^*^n — lby ea l ~ 2 . Finally, we can reach the sink state _L from the initial 
state by ef = ( Q n —> n ). This gives a total of 2” -2 + n — 1 reachable states, 
which matches the upper bound. 

To see these states are distinguishable, note that Ax is an atom if {n} C 
X C Q n \ {1}. Also, A-^jj = is an atom. Hence by the Distinguishability 

Lemma, all states of the form (X, {1}) are distinguishable from each other and 
from Y. Also, ({n}, {i}) is distinguished from ({?r}, {j}) by a n ~ l f, which sends 
the former state to the non-final state _L, but sends the latter to some final state 
({n}, {k}) with k ^ 2. And each ({?r}, {j}), lisa final state, so it is 

distinguishable from all states of the form (X, {1}), X ^ {n} and from _L, since 
they are not final. Hence all 2 n ~ 2 +n — l reachable states are distinguishable. □ 


8 Some Numerical Results 

The following tables compare the maximal complexities for atoms Ag of two- 
sided ideals (first entry), left ideals (second entry) and regular languages (third 
entry) with complexity n. Right ideals are omitted because their complexities 
are essentially the same as those of left ideals, by Remark 1. When the maximal 
complexity is undefined (e.g., because no languages in a class have atoms Ag for 
a particular size of S) this is indicated by an asterisk. The maximum values for 
each n are in boldface. The ?r th entry in the ratio row shows the approximate 
value of m n /m n -i, where ?n,; is the i th entry in the max row. 
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n 

1 

2 

3 

4 

5 

B 

|S| = o 

*/l/l 

*/2/3 

*/4/7 

*/8/15 

*/16/31 

B 

1*51 = 1 

ESS! 

2/2/3 

3/5/10 

5/13/29 

9/33/76 


l*5|=2 


2/2/3 

4/4/10 

8/16/43 

20/53/141 


l-5|=3 



3/3/7 

7/8/29 

20/43/141 


l-5|=4 




4/4/15 

12/16/76 


l*5|=5 





5/5/31 


max 

i/i/i 

2/2/3 

4/5/10 

8/16/43 

20/53/141 


ratio 

- 

2.00/2.00/3.00 

2.00/2.50/3.33 

2.00/3.20/4.30 

2.50/3.31/3.28 



n 

6 

7 

8 

9 

l*5| = 0 

*/32/63 

*/64/127 

*/128/255 

*/256/511 

l*5| = 1 

17/81/187 

33/193/442 

65/449/1,017 

129/1, 025/2, 296 

l*5| = 2 

48/156/406 

112/427/1,086 

256/1,114/2,773 

576/2,809/6, 859 

l*5| = 3 

64/166/501 

182/542/1,548 

484/1,611/4,425 

1,234/4,517/12,043 

l*5| = 4 

48/106/406 

182/462/1, 548 

584/1,646/5,083 

1,710/5,245/15,361 

l-5| = 5 

21/32/187 

112/249/1,086 

484/1,205/4,425 

1,710/4,643/15,361 

l*5| = 6 

6/6/63 

38/64/442 

256/568/2, 773 

1,234/3,019/12,043 

l*5| = 7 


7/7/127 

71/128/1,017 

576/1,271/6,859 

l*5| = 8 



8/8/255 

136/256/2, 296 

l*5| = 9 




9/9/511 

max 

64/166/501 

182/542/1,548 

584/1,646/5,083 

1,710/5,245/15,361 

ratio 

3.20/3.13/3.55 

2.84/3.27/3.09 

3.21/3.04/3.28 

2.93/3.19/3.02 


9 Conclusions 

We have derived tight upper bounds for the number of atoms and quotient com¬ 
plexity of atoms in right, left and two-sided regular ideal languages. The recently 
discovered relationship between atoms and the Myhill and Nerode congruence 
classes opens up many interesting research questions. The quotient complexity 
of a language is equal to the number of Nerode classes, and the number of Myhill 
classes has also been used as a measure of complexity, called syntactic complexity 
since it is equal to the size of the syntactic semigroup. We can view the number 
of atoms as a third fundamental measure of complexity for regular languages. 

It is known [8] that the number of atoms of a regular language L is equal to 
the quotient complexity of the reversal of L. The quotient complexity of reversal 
has been studied for various classes of languages in the context of determining 
the quotient complexity of operations on regular languages. Hence, the maximal 
number of atoms is known for many language classes. 

However, as far as we know the quotient complexity of atoms has not been 
studied outside of regular languages and ideals. For simplicity, let us call the atom 
congruence the left congruence , the Nerode congruence the right congruence , and 
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the Myhill congruence the central congruence. When computing the quotient 
complexity of atoms, we are computing the number of right congruence classes 
of each left congruence class. We can consider other permutations of this idea: 
how many right classes and left classes do the central classes have? How many 
central classes do the left classes have? These questions are outside the scope of 
this paper, but we believe they should be investigated. 
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